ECHO CANCELLATION/SUPPRESSION AND DOUBLE-TALK 



DETECTION IN COMMUNICATION PATHS 
Field Of The Invention 

[001] The present invention relates generally to telecommunications and in 

particular to echo cancellation/suppression and double-talk detection in 
communication paths. 

5 

Background Of The Invention 

[002] Acoustic coupling in telephone device handsets is common and occurs 

when signals broadcast by the speaker of a telephone device are coupled to the 
microphone of the same telephone device. This acoustic coupling causes echo signals 

10 in the network over which the communication channel is established. The acoustic 
suppression of signals broadcast by the handset speaker before they are picked up by 
the handset microphone is variable and depends on a number of factors, such as for 
example, the pressure exerted on the user's ear by the handset speaker and how well 
the handset speaker seals the user's ear. The loss in power of these signals as a result 

15 of acoustic suppression can vary from about -50db to as low as -24db when the 
telephone device handset is placed on a hard surface. If delays in the network are 
short (i.e. 30ms), the normal acoustic suppression of signals broadcast by the handset 
speaker is generally sufficient to inhibit voice quality from being noticeably degraded 
by echo signals picked up by the handset microphone. However, if the delays in the 

20 network are significant, echo signals in the network due to acoustic coupling will 
noticeably degrade voice quality over the communication channel. As such, 
suppressing echo signals resulting from acoustic coupling in telephone devices is 
important. 

[003] Echo cancellers in telephone devices have been considered. A typical 

25 echo canceller attempts to model the transfer function of the echo signal path using a 
linear algorithm such as a Least-Mean-Squared (LMS) algorithm. The estimated echo 
signals generated by the echo canceller are subtracted from the echo signals picked up 
by the handset microphone. Differences between the estimated echo signals and the 
actual echo signals result in error signals, which are fed back to the echo canceller. 
30 Unfortunately, since the algorithm executed by the echo canceller is linear, the echo 
canceller cannot deal with non-linear effects and can only converge to a transfer 
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function that approximates the echo signals. As a result, residual echo signals 
propagate through the echo canceller to the network. 

[004] To deal with the deficiencies noted above, an echo suppressor 

has been considered and is described in U.K. Patent Application Serial No. 
5 9907102.9 filed on March 26, 1999. This echo suppressor includes a power level 

calculator that determines the power level of signals broadcast by the handset speaker. 
A mask generator is responsive to the power level calculator and generates masks that 
are a function of the determined power levels to suppress echo signals received by the 
handset microphone. To avoid noticeable switching effects in the presence of 
10 background noise, the echo suppressor can be conditioned to generate leaky masks. 
This is achieved by setting one or a few of the least significant bits (LSBs) in the 
masks to one (1) so that the lower bits of the received echo signals are leaked through 
the echo suppressor. 

[005] Although this echo suppressor works well to suppress echo, since the 

15 number of least significant bits in the masks that are set to one to leak echo signals is 
static, switching effects that degrade voice quality still result. Also, since the echo 
suppressor is active all the time with the number of zeros in the masks changing 
depending on the power level of the signals broadcast by the handset speaker, the 
echo suppressor degrades speech quality during double-talk situations. As will be 
20 appreciated, improvements in echo cancellation/suppression are desired. 

[006] It is therefore an object of the present invention to provide a novel 

echo suppressor and method for suppressing echoes in a communication path. 

Summary Of The Invention 
25 [007] According to one aspect of the present invention there is provided a 

method of suppressing an echo signal in a communication path comprising the steps 
of: 

monitoring a transmitted signal supplied to said communication path to 
determine an attribute thereof; 
30 generating a leaky mask based on said determined attribute and an 

attribute of a received echo signal; and 

partially cancelling said received echo signal using said leaky mask 
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[008] Preferably, the attribute of the transmitted signal is the power level 

thereof and the attribute of the received echo signal is the noise level thereof. In the 
preferred embodiment, the generating includes the steps of generating a suppression 
mask based on the power level of the transmitted signal; generating noise leaking bits 
5 based on the noise level of the received echo signal; and combining the suppression 
mask and the noise leaking bits to yield the leaky mask. 

[009] It is also preferred that the partially cancelling of step is inhibited in 

the presence of double-talk. Double-talk is declared when the power level of the 
received echo signal exceeds a threshold value. 
10 [010] In another embodiment, it is preferred that during the generating, an 

estimated echo signal is also generated based on the power level of the transmitted 
signal and that the partially cancelling step is performed after the estimated echo 
signal is subtracted from the received echo signal. 

[011] - According to another aspect of the present there is provided an echo 
1 5 suppressor to suppress an echo signal in a communication path comprising: 

a power level calculator determining the power level of a transmitted 
signal supplied to said communication path; and 

a mask generator responsive to said power level calculator and 
generating a leaky mask, said leaky mask being generated as a function of the 
20 determined power level and the noise level of a received echo signal, said leaky mask 
being applied to the received echo signal thereby to suppress partially said received 
echo signal. 

[012] According to yet another aspect of the present invention there is 

provided an echo suppressor to suppress an echo signal in a communication path 
25 comprising: 

a digital signal processor for determining the power level of a 
transmitted signal supplied to said communication path and for determining the noise 
level of a received echo signal, said digital signal processor generating a leaky mask 
based on the determined power level and the determined noise level; and 
30 a multiplier for combining said leaky mask and said received echo 

signal thereby to suppress partially said received echo signal. 



[013] According to still yet another aspect of the present invention there is 

provided in a telephone device including a handset having a speaker to broadcast 
incoming signals and a microphone to receive outgoing signals, an echo suppressor to 
suppress echo signals picked up by the microphone as a result of acoustic coupling 
between said speaker and microphone comprising: 

a power level calculator determining the power level of transmitted 
signals supplied to said communication path; and 

a mask generator responsive to said power level calculator and 
generating leaky masks, said leaky masks being generated as a function of the 
determined power level and the noise level of received echo signals, said leaky masks 
being applied to the received echo signals thereby to suppress partially said received 
echo signals. 

[014] According to still yet another aspect of the present invention there is 

provided an echo suppressor to suppress echo signals generated in a communication 
path comprising: 

an echo canceller in parallel with said communication path, said echo 
canceller having a transfer function approximating that of said communication path 
and generating estimated echo signals in response to transmitted signals supplied to 
said communication path, said echo canceller subtracting said estimated echo signals 
from echo signals received from said communication path to generate residual echo 
error signals; and 

a processor receiving said estimated echo signals and said residual 
echo error signals, said processor determining the power level of the transmitted 
signals supplied to said communication path generating leaky masks, said leaky 
masks being generated as a function of the determined power level of said transmitted 
signals and the noise level of received echo signals, said leaky masks being applied to 
the residual echo error signals thereby to suppress partially said residual echo error 
signals. 

[015] The present invention provides advantages in that since the echo 

suppressor generates adaptive leaky masks, echo suppression is achieved while 
maintaining voice quality and avoiding switching effects in the presence of 
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background noise. Also, since echo suppression is switched off in the presence of 
double-talk, voice quality is maintained in double-talk situations. 

Brief Description Of The Drawings 
5 [016] Embodiments of the present invention will now be described more 

fully with reference to the accompanying drawings in which: 

Figure 1 is a schematic diagram of a communication channel 
established between a pair of telephone devices over a network showing acoustic 
coupling in one of the telephone devices; 
10 Figure 2 is a schematic diagram of a telephone device including an 

echo suppressor in accordance with the present invention; 

Figure 3 is a graph showing an envelope calculated by the echo 
suppressor of Figure 2 following the power level of a signal broadcast by a telephone 
device handset speaker; and 
15 Figure 4 is a schematic diagram of a conventional echo canceller and a 

non-linear processor to suppress residual echo signals in accordance with the present 
invention. 

Detailed Description Of The Preferred Embodiment 
20 [017] Turning now to Figure 1, a communication channel established 

between a pair of telephone devices 12 and 14 over a network 16 is shown and is 
generally identified by reference numeral 10. As can be seen, when a communication 
channel is established between the telephone devices 12 and 14, acoustic signals 18 
broadcast by the handset speaker 20 of receiving telephone device 14 are acoustically 
25 coupled to the handset microphone 22 of the telephone device 14. The echo signals 
picked up by the handset microphone 22 as a result of the acoustic coupling cause 
echoes in the network 16, which degrade voice quality. If the delay in the network 16 
is long, such as for example 150ms, which may be caused by voice packetization and 
local area network (LAN) propagation delays, echoes in the network 1 6 as a result of 
30 the acoustic coupling become audile detracting from voice quality. 

[018] To suppress adaptively echo signals picked up by the handset 

microphone 22 as a result of acoustic coupling, an echo suppressor 32 in accordance 



with the present invention is provided in each of the telephone devices 12 and 14 
respectively. Figure 2 better illustrates telephone device 14. As can be seen, 
telephone device 14 includes a line receiver 40 coupled between the network 16 and 
an analog to digital converter 42. Analog to digital converter 42 provides digital 
5 output to the echo suppressor 32 and to a digital to analog converter 44. The digital to 
analog converter 44 conveys its analog output to a speaker driver 46, which amplifies 
the analog signals before they are broadcast by the handset speaker 20. 
[019] Coupled to the handset microphone 22 is a microphone receiver 48, 

which provides output to an analog to digital converter 50. Analog to digital 
10 converter 50 provides digital output to the echo suppressor 32. Echo suppressor 32 in 
turn supplies output to a digital to analog converter 52 that is coupled to the network 
16 via a line transmitter 54. 

[020] The echo suppressor 32 couples the handset speaker 20 and the handset 

microphone 22 of the telephone device 14. The echo suppressor 32 adaptively masks 

15 echo signals picked up by the handset microphone 22 to inhibit echo in the network 
16 using leaky masks to reduce switching effects in the presence of background noise. 
Specifically, the echo suppressor 32 generates suppressions mask based on the power 
level of signals broadcast by the handset speaker 20. This is due to the fact that 
typically, the larger the signals broadcast by the handset speaker 20, the larger the 

20 echo signals picked up by the handset microphone 22 as a result of acoustic coupling 
tend to be. The echo suppressor 32 also generates noise leaking least significant bits 
based on the noise level of the received echo signals that are needed to leak 
background noise through the echo suppressor 32. The suppression masks and the 
noise leaking LSBs are combined to yield echo suppression adaptive masks that are 

25 used by the echo suppressor 32 to suppress the received echo signals. 

[021] In addition, the echo suppressor 32 performs double-talk detection 

based on the power level of the received echo signals and the acoustic coupling loss 
between the handset speaker 20 and the handset microphone 22. When double-talk is 
declared, the bits of the adaptive masks generated by the echo suppressor 32 are all set 

30 to one (1) allowing the received echo signals to pass through the echo suppressor 32 
unchanged. 



[022] In the present embodiment, the echo suppressor 32 is embodied in a 

programmable digital signal processor (DSP) 60. The DSP 60 executes an echo 
suppression algorithm, an echo signal noise level calculation algorithm and a double- 
talk detection algorithm to achieve the functionality described above. Further 
5 specifics concerning the echo suppression, echo signal noise level calculation and 

double-talk detection algorithms will now be described. During execution of the echo 
suppression algorithm, the DSP 60 performs a power level calculation to determine 
the power level of the signal received by the telephone device 14 that is to be 
broadcast by the handset speaker 20 and uses the determined power level to generate 

10 the suppression mask. 

[023] During execution of the echo signal noise level calculation routine, the 

DSP 60 calculates the noise level of the received echo signal using the method 
described in U.K. Patent Application Serial No. 0023975.6 filed on September 30, 
2000, the content of which is incorporated herein by reference. The value of the 

15 calculated echo signal noise level is then used to determine the number of noise 
leaking bits required to leak effectively the background noise and avoid switching 
effects. The suppression mask and the noise leaking bits are then combined to yield 
the echo suppression adaptive mask. 

[024] More particularly, the level of echo signal is calculated based on the 

20 envelope of transmitted signal and echo suppression value using the formula: 

level(echo) = envelope(transmit_signal)/acoustic_coupling_loss_value (e.g. 24dB) (1) 
The value for the mask is chosen based on the estimated echo signal level. The 
number of noise leaking bits is chosen based on the calculated level of background 
noise picked up by the microphone 22. For example, if the estimated echo signal 
25 level is equal to 256 (2 A 8), then the mask needed to completely suppress the echo will 
have 8 LSBs equal to zero (i.e. ECHO_MASK=0xFF00). If the value for the 
estimated noise is equal to 7, the number of bits needed to leak the noise through is 3, 
(i.e. NOISE_MASK = 0x0007). The echo suppression adaptive mask is then 
calculated using the formula: 
30 MASK = ECHO_MASK OR NOISE_MASK (2) 

For the example given above the value of the mask is MASK=0xFF07. 
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[025] During execution of the double-talk detection algorithm, the DSP 60 

calculates the power level of the received echo signal and compares the calculated 
power level with a threshold value. The threshold value is a predefined factor of the 
expected echo signal level, based on the acoustic coupling loss value. In simple 
5 terms, double talk is declared whenever the signal received by the microphone 22 is 
bigger than the expected echo signal which, as discussed above, is equal to the 
envelope of transmitted signal divided by the acoustic coupling loss value. The 
expected echo signal level is determined from the power level of the signal broadcast 
by the handset speaker 20 and the value of the acoustic coupling loss between the 
10 handset speaker 20 and the handset microphone 22. The acoustic coupling loss is 

directly related to the acoustic properties of a given handset. It is basically always the 
same for a specific handset design and specific scenario/application, and has to be 
measured electrically or acoustically using lab equipment. 

[026] For example, a typical handset may be characterized by an acoustic 

15 coupling loss with the handset placed face-down on a hard desk, of-24dB, and if the 
handset is sealed to the ear, the acoustic coupling loss is -50dB. In the algorithm for 
implementing the preferred embodiment of the invention, a value of -24dB is used, 
on the understanding that this value accomodates the worst-case scenario for the given 
handset. If the calculated echo signal power level is greater than the threshold value, 
20 double-talk is declared. When double-talk is declared, all of the bits of the adaptive 
mask are set to one (1). Appendix A shows pseudo-code representing the echo 
suppression, echo signal noise level calculation and double-talk detection algorithms 
executed by the DSP 60. 

[027] Once the echo suppression adaptive mask is established, the echo 

25 suppression adaptive mask is applied to the echo signal received by the handset 
microphone 22 via a multiplier 62. If double-talk has been declared, the echo 
suppressor 32 is effectively switched off allowing the echo signal to pass through the 
echo suppressor 32 unchanged. If double-talk has not been declared, the echo signal 
is cancelled or masked with the exception of background noise that is allowed to pass 
30 through the echo suppressor 32 as a result of the noise leaking bits. 

[028] Appendix B illustrates power level calculation and mask selection 

routines of the echo suppression algorithm used to calculate the power level of the 
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signal broadcast by the handset speaker 20 and to select the appropriate suppression 
mask based on the calculated power level. During execution of the power level 
calculation routine, an envelope following the power level of the signal broadcast by 
the handset speaker 20 is generated using an infinite impulse response (IIR) lowpass 
5 filter. The IIR filter generates the envelope by estimating the long-range average of 
the absolute value of the signal broadcast by the handset speaker 20 and is of the 
form: 

AbsY = (1 -alpha)AbsY + alpha* AbsYO (3) 
Alpha is an IIR filter parameter and is chosen to provide a fast attack time and a slow 
10 decay time for the IIR filter. In the present embodiment, two different values for 
alpha are used, namely alpha_fast and alpha_slow depending on the power level of 
the signal broadcast by the handset speaker 20. Once the envelope has been 
generated, the DSP 60 selects the suppression mask in accordance with the mask 
selection routine. 

1 5 [029] Figure 3 shows an example of an envelope 64 generated by the echo 

suppressor 32 in response to a signal broadcast by the handset speaker 20 where 
alpha_fast = 1 and alpha_slow = 2" 12 . As will be appreciated, by choosing these 
values for alpha, the echo suppressor 32 generates an envelope that reacts fast to 
signals broadcast by the handset speaker 20. The slow decay time on the other hand 

20 compensates for small signal delays and reduces the switching effect when the signals 
fade. 

[030] If desired, the DSP 60 can take actual acoustic coupling between the 

handset speaker 20 and handset microphone 22 into account. In this case, as the 
envelope is generated, the DSP 60 calculates the value of the acoustic coupling loss 

25 between the handset speaker 20 and the handset microphone 22 based on the received 
echo signal and the measured acoustic coupling loss, as discussed above. After the 
value of the acoustic coupling loss has been calculated, the mask selection routine 
selects the suppression mask in accordance with the routine illustrated in Appendix B, 
from which it will be noted that suppression mask is calculated based on the level of 

30 expected echo, which is: 

level(echo) = envelope(transmit_signal)/acoustic_coupling_loss_value (4) 
Thus, a smaller acoustic coupling loss value results in a larger suppression mask. 
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[031] Turning now to Figure 4, an alternative embodiment of the present 

invention is shown. In this embodiment, a conventional echo canceller 70 is in 
parallel with a communication path 71 in which echo signals are generated. The 
communication path 71 may be a telephone device handset as described in the 
5 previous embodiment. Alternatively, the communication path 71 may be a hands free 
telephone, a network which causes network reflections, or another source of echo 
signals. The echo canceller 70 attempts to model the transfer function of the echo 
signal path using a least-mean-squared (LMS) algorithm and generates estimated echo 
signals. The estimated echo signals are subtracted 72 from the echo signals received 

10 by the echo canceller 70 in an attempt to cancel the received echo signals. The 

amount of cancellation is commonly referred to as ERLE. Differences between the 
estimated echo signals and the received echo signals result in residual echo signals. 
The residual echo signals are fed back to the echo canceller 70 so that the echo 
canceller can attempt to converge to the correct transfer function. 

1 5 [032] Unfortunately, the LMS algorithm only monitors linear effects in the 

echo path. It does not deal with non-linear effects caused for example by, clipping, 
telephone key rattling, and frequency shift. Therefore, the echo canceller 70 
converges to a transfer function that approximates the correct transfer function 
resulting in non-zero residual echo signals. These non-zero residual echo signals 

20 result in echo signals in the network. 

[033] To suppress the residual echo signals, a non-linear processor (NLP) 80 

is provided and receives the estimated echo signals output by the echo canceller 70 as 
well as the residual echo signals output by the subtracter 72. The NLP 80 executes 
the echo suppression, echo signal noise level calculation and double-talk detection 

25 algorithms described previously. The leaky masks generated by the NLP 80 are 
combined with the residual echo signals to suppress the residual echos with the 
exception of background noise which is allowed to pass through the NLP 80 as a 
result of the noise leaking bits. 

[034] Taking actual acoustic coupling into account is particularly of interest 

30 in this environment especially in situations where the existence of double-talk is 
indeterminate. The NLP 80 needs to be active only during the echo periods, and 
should be switched off during double-silence and double-talk periods. Since it is not 
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a straightforward matter to correctly determine exactly when the NLP should be 
switched on, it is important to reduce the speech degradation in cases when the NLP 
80 has been incorrectly switched on during periods of double-talk. Some NLP 
designs totally cut-off the signal when echo is present and replace the signal with 
5 regenerated noise or even silence. Obviously, if such an NLP is active when near end 
speech signal is present, instead of echo, the speech will be cut-off and distorted. 
According to the present invention, the large mask introduces more distortion to the 
speech than a smaller mask, because it clears more bits of information. 
[035] Although the preferred embodiment of the echo suppressor 32 is 

1 0 described with reference to a programmable DSP, those of skill in the art will 

appreciate that alternative implementations can be used. For example, the algorithms 
may be run on a hardware ASIC (i.e. a non-programmable digital processing device) 
or other comparable hardware device. 

[036] Although preferred embodiments of the present invention have been 

1 5 described, those of skill in the art will appreciate that variations and modifications 
may be made without departing from the spirit and scope thereof as defined by the 
appended claims. 
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APPENDIX A 

Inputs: transmitted_signal(l), received_signal (2) 
Output: output_signal going to the network (3) 
5 Start: 

Calculate the power of transmitted-signal; 

AbsY = (l-alpha)AbsY + alpha*abs(transmittedsignal); 

Chose the mask that corresponds to the power of the transmitted-signal; 

Echo_Mask = Mask_select(AbsY, Acoustic_Coupling_Loss); 
10 Calculate the noise level of received-signal 

noise_level = NoiseLevelCal(received_signal); 

Noise_Mask =0; 

while noise level >0 
Noise_Mask =( NoiseJVlask *2)+l; 
1 5 noise_level =fix(noise_level /2); 

end 

Mask = mask_combine(Echo_Mask, Noise_Mask); 
/* Double-Talk calculation */ 
Calculate the power of the received signal; 
20 AbsY_echo = (1 -alpha) AbsY_echo + alpha*abs(received_signal); 
if (AbsY/Acoustic_Coupling_Loss < AbsY_echo) 

dt=l; /* Double-talk is declared*/ 
else 

dt=0; /* Single-Talk (Echo) is declared*/ 
25 end 

if(dt=l) 

Mask = OxFFFF; 
end 

If received_signal is positive 
30 Output_signal = received_signal AND Mask; 
If received_signal is negative 

Output_signal =-(abs(received_signal) AND Mask); 
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Go to Start; 
where: 

transmittedsignal is the signal received by the telephone device to be 
broadcast by the handset speaker; 

received_signal is the echo signal picked up by the handset 

microphone; 

alpha is an IIR filter parameter; and 

Output_signal is the signal output to the network by the telephone 

device. 



-14- 



APPENDIX B 



10 



Power Level Calculation Routine 

if AbsY > AbsYO 

Abs Y=( 1 - alpha_slow)*AbsY + alpha_slow * AbsYO; 
else 

AbsY=(l- alpha_fast)*AbsY + alpha_fast * AbsYO; 
end 

Mask Selection Routine 



15 



20 



25 



30 



35 



40 



Mask_select: 

Mask = 64512; 
if AbsY < 4063 
Mask = 65024; 
end 

if AbsY < 2031 

Mask= 65280; %ff00 or 8 zeros 
end 

if AbsY < 1015 

Mask= 65408; %ff80 or 7 zeros 
end 

if AbsY < 507 

Mask= 65472; %ffc0 or 6 zeros 
end 

if AbsY < 253 

Mask = 65504; %ffe0 or 5 zeros 
end 

if AbsY < 126 

Mask= 65520; %fff0 or 4 zeros 
end 

if AbsY < 63 

Mask = 65528; %fff8 or 3 zeros 
end 

if AbsY < 31 

Mask = 65532; %fffc or 2 zeros 
end 

if AbsY < 15 

Mask = 65534; %fffe or 1 zero 
end 



%fc00 or 1 0 zeros (111111 0000000000) 
%fe00 or 9 zeros 



